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Abstract 

Recently Rubinfeld et al. (ICS 2011, pp. 223-238) proposed a new model of subHnear algorithms 
called local computation algorithms. In this model, a computation problem F may have more than one 
legal solution and each of them consists of many bits. The local computation algorithm for F should 
answer in an online fashion, for any index i, the i*'' bit of some legal solution of F. Further, all the 
answers given by the algorithm should be consistent with at least one solution of F. 

In this work, we continue the study of local computation algorithms. In particular, we develop 
a technique which under certain conditions can be applied to construct local computation algorithms 
that run not only in polylogarithmic time but also in polylogarithmic space. Moreover, these local 
computation algorithms are easily parallelizable and can answer all parallel queries consistently. Our 
main technical tools are pseudorandom numbers with bounded independence and the theory of branching 
processes. 
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1 Introduction 



The classical view of algorithmic analysis, in which the algorithm reads the entire input, performs a com- 
putation and then writes out the entire output, is less applicable in the context of computations on massive 
data sets. To address this difficulty, several alternative models of computation have been adopted, including 
distributed computation as well as various sub-linear time and space models. 

Local computation algorithms (LCAs) were proposed in ll24ll to model the scenario in which inputs to 
and outputs from the algorithms are large, such that writing out the entire output requires an amount of time 
that is unacceptable. On the other hand, only small portions of the output are required at any point in time 
by any specific user. LCAs support queries to the output by the user, such that after each query to a specified 
location i, the LCA outputs the value of the output at location i. LCAs were inspired by and intended as a 
generalization of several models that appear in the literature, including local algorithms, locally decodable 
codes and local reconstruction algorithms. LCAs whose time complexity is efficient in terms of the amount 
of solution requested by the user have been given for various combinatorial and coding theoretic problems. 

One difficulty is that for many computations, more than one output is considered to be valid, yet the 
values returned by the LCA over time must be consistent. Often, the straightforward solutions ask that the 
LCA store intermediate values of the computations in order to maintain consistency for later computations. 
Though standard techniques can be useful for recomputing the values of random coin tosses in a straightfor- 
ward manner, some algorithms (e.g., many greedy algorithms) choose very different solutions based on the 
order of input queries. Thus, though the time requirements of the LCA may be efficient for each query, it 
is not always clear how to bound the storage requirements of the LCA by a function that is sublinear in the 
size of the query history. It is this issue that we focus on in this paper. 

1.1 Our main results 

Before stating our main results, we mention two additional desirable properties of LCAs. Both of these 
properties are achieved in our constructions of LCAs with small storage requirements. The first is that an 
LCA should be query oblivious, that is the outputs of A should not depend on the order of the queries but 
only on the input and the random bits generated on the random tape of A. The second is that the LCA should 
be parallelizable, i.e., that it is able to answer multiple queries simultaneously in a consistent manner. 

All the LCAs given in |[25l suffer from one or more of the following drawbacks: the worst case space 
complexity is linear, the LCA is not query oblivious, and the LCA is not parallelizable. We give new 
techniques to construct LCAs for the problems studied in ||25l which run in polylogarithmic time as well as 
polylogarithmic space. Moreover, all of the LCAs ai^e query oblivious and easily parallelizable. 

Theorem 1.1 (Main Theorem 1 (informal)). There is an LCA for Hypergraph Coloring that runs in poly- 
logarithmic time and space. Moreover, the LCA is query oblivious and parallelizable. 

Theorem 1.2 (Main Theorem 2 (informal)). There is an LCA for Maximal Independent Set that runs in 
polylogarithmic time and space. Moreover, the LCA is query oblivious and parallelizable. 

We remark that following |[25l . analogous techniques can be applied to construct LCAs with all of the 
desirable properties for the radio network problem and fc-CNF problems. 

1.2 Techniques 

There are two main technical obstacle in making the LCAs constructed in |[25l space efficient, query obliv- 
ious and pai^allelizable. The first is that LCAs need to remember all the random bits used in computing 



1 



previous queries. The second issue is more subtle - Il25l give LCAs based on algorithms which use very 
little additional time resources per query as they simulate greedy algorithms. These LCAs output results 
that depend directly on the orders in which queries are fed into the algorithms. 

We address the randomness issue first. The space inefficient LCAs constructed in ll25l for the problems 
of concern to us are probabilistic by nature. Consistency among answers to the queries seems to demand 
that the algorithm keeps track of all random bits used so far, which would incur linear space complexity. 
A simple but very useful observation is that all the computations are local and thus involve a very small 
number of random bits. Therefore we may replace the truly random bits with random variables of limited 
independence. The construction of small sample space A;-wise independent random variables of Alon et 
al. IS allows us to reduce the space complexity from linear to poly logarithmic. This allows us to prove 
our main theorem on the LCA for the maximal independent set problem. It is also an important ingredient 
in constructing our LCA for Hypergraph Coloring. We believe such a technique will be a standard tool in 
future development of LCAs. 

For Hypergraph Coloring, we need to also address the second issue raised above. The original LCA for 
Hypergraph Coloring in |[25l emulates Alon's algorithm Alon's algorithm runs in three phases. During 
the first phase, it colors all vertices in an arbitrary order. Such an algorithm looks "global" in nature and 
it is therefore non-trivial to turn it into an LCA. In ll25l . they use the order of vertices being queried as the 
order of coloring in Alon's algorithm, hence the algorithm needs to store all answers to previous queries and 
requires linear space in computation. 

We take a different approach to overcome this difficulty. Observe that there is some "local" dependency 
among the colors of vertices - namely, the color of any vertex depends only on the colors of at most a 
constant number, say D, other vertices. The colors of these vertices in turn depend on the colors of their 
neighboring vertices, and so on. We can model the hypergraph coloring process by a query tree: Suppose 
the color of vertex x is now queried. Then the root node of the query tree is x, the nodes on the first level are 
the vertices whose colors the color of x depends on. In general, the colors of nodes on level i depends or0 
the colors of nodes on level i + 1. Note that the query tree has degree bound D and moreover, the size of 
the query tree clearly depends on the order in which vertices are colored, since the color of a vertex depends 
only on vertices that are colored before it. In particular, if x is the fe"^ vertex to be colored, then the query 
tree contains at most k vertices. 

An important fact to note is that Alon's algorithm works for any order, in particular, it works for a 
random order. Therefore we can apply the random order method of Nguyen and Onak ll20l : generate a 
random number r € [0, 1], called the rank, and use these ranks to prune the original query tree into a 
random query tree T. Specifically, T is defined recursively: the root of T is still x. A node z is in T if its 
parent node y in the original query tree is in T and r{z) < r{y). Intuitively, a random query tree is small 
and indeed it is surprisingly small ll20l : the expected size of T is a constant! 

Therefore, if we color the vertices in the hypergraph in a random order, the expected number of vertices 
we need to color is only a constant. However, such an "average case" result is insufficient for our LCA 
purpose: what we need is a "worst case" result which tells almost surely how large a random query tree will 
be. In other words, we need a concentration result on the sizes of the random query trees. The previous 
techniques in 1201 l28l do not seem to work in this setting. 

Consider the worst case in which the rank of the root node x is 1. A key observation is, although there 
are D child nodes of x, only the nodes whose ranks are close to 1 are important, as the child nodes with 
smaller ranks will die out quickly. But in expectation there will be very few important nodes! This inspires 

'in fact, they may depend on the colors of some nodes on levels lower than i. However, as we care only about query complexity, 
we will focus on the worst case that the query relations form a tree. 
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us to partition the random query tree into D + 1 levels based on the ranks of the nodes, and analyze the 
sizes of trees on each level using the theory of branching processes. In particular, we apply a quantitative 
bound on the total number of off-springs of a Galton-Watson process 1221 to show that, for any m > 0, with 
probability at least 1 — 1/m^ the size of a random query tree has at most C{D) log^^^ m vertices, where 
C{D) is some constant depending only on D. We conjecture that the upper bound can be further reduced to 
C{D)logm. 

However, the random order approach raise another issue: how do we store the ranks of all vertices? Ob- 
serve that in constructing a random query tree, the actual values of the ranks are never used - only the relative 
orders between vertices matter. This fact together with the fact that all computations are local enables us to 
replace the rank function with some pseudorandom ordering among the vertices, see Section H] for formal 
definition and construction. The space complexity of the pseudorandom ordering is only polylogarithmic, 
thus making the total space complexity of the LCA also polylogarithmic. 

1.3 Other related work 

Locally decodable codes lO which given an encoding of a message, provide quick access to the requested 
bits of the original message, can be viewed as LCAs. Known constructions of LDCs are efficient and use 
small space ll27l . LCAs generalize the reconstruction models described in |[T] |6l |26l |71. These models 
describe scenarios where an input string that has a certain property, such as monotonicity, is assumed to be 
corrupted at a relatively small number of locations. The reconstruction algorithm gives fast query access to 
an uncorrupted version of the string that is close to the original input. Most of the works mentioned are also 
efficient in terms of space. 

In Il24l . it is noted that the model of LCAs is related to local algorithms, studied in the context of 
distributed computing |[T9l [TtI [121 [T3l [141 [TTl [TOl . This is due to a reduction given by Parnas Ron |[23l 
which allows one to construct (sequential) LCAs based on constant round distributed algorithms. Note that 
this relationship does not immediately yield space-efficient local algorithms, nor does it yield sub-linear 
time LCAs when used with parallel or distributed algorithms whose round complexity is O(logn). 

Recent exciting developments in sublinear time algorithms for sparse graph and combinatorial optimiza- 
tion problems have led to new constant time algorithms for approximating the size of a minimum vertex 
cover, maximal matching, maximum matching, minimum dominating set, minimum set cover, packing and 
covering problems (cf. |[23l[T6l[20l[28l ). For example, for Maximal Independent Set, these algorithms con- 
struct a constant-time oracle which for most, but not all, vertices outputs whether or not the vertex is part of 
the independent set. For the above approximation algorithms, it is not necessary to get the correct answer 
for each vertex, but for LCAs, which must work for any sequence of online inputs, the requirements are 
more stringent, thus the techniques are not applicable without modification. 

1.4 Organization 

The rest of the paper is organized as follows. Some preliminaries and notations that we use throughout the 
paper appear in Section|2l We then prove our main technical result, namely the bound on the sizes of random 
query trees in Section |3l In Section |4] we construct pseudorandom orderings with small space. Finally we 
apply the techniques developed in Section [3] and Section H] to construct LCAs for the hypergraph coloring 
problem and the maximal independent set problem in Section |5] and Section |6l respectively. 
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2 Preliminaries 



Unless stated otherwise, all logarithms in this paper are to the base 2. Let 7Z > 1 be a natural number. We 
use [n] to denote the set {1, . . . , n}. 

All graphs in this paper are undirected graphs. Let G = {V, E) be a graph. The distance between two 
vertices u and v in V{G), denoted by dciujv), is the length of a shortest path between the two vertices. 
We write Ng{v) = {m € V{G) : {u, v) € E{G)} to denote the neighboring vertices of v. Furthermore, let 
Nq{v) = N{v) U {v}. Let dciv) denote the degree of a vertex v. 

2.1 Local computation algorithms 

We present our model of local computation algorithms: Let F be a computational problem and x be an input 

to F. Let F(x) = {y I y is a valid solution for input x}. The search problem is to find any y € F{x). 

Definition 2.1 ((t, s, (^)-local algorithms |25l ). Let x and F{x) be defined as above. A {t{n), s{n), 6{n))- 
local computation algorithm ^ is a (randomized) algorithm which implements query access to an arbitrary 
y G F{x) and satisfies the following: A gets a sequence of queries ii,. . . ,iq for any q > and after 
each query ij it must produce an output yi^ satisfying that the outputs , . . . , yi^ are substrings of some 
y G F{x). The probability of success over all q queries must be at least 1 — 6{n). A has access to a 
random tape and local computation memory on which it can perform cuiTcnt computations as well as store 
and retrieve information from previous computations. We assume that the input x, the local computation 
tape and any random bits used are all presented in the RAM word model, i.e., A is given the ability to access 
a word of any of these in one step. The running time of A on any query is at most t{n), which is sublinear in 
n, and the size of the local computation memory of A is at most s{n). Unless stated otherwise, we always 
assume that the error parameter 6{n) is at most some constant, say, 1/3. We say that ^ is a strongly local 
computation algorithm if both t{n) and s{n) are upper bounded by log'^ n for some constant c. 

Two important properties of LCAs are as follows: 

Definition 2.2 (Query oblivious (251). We say an LCA A is query order oblivious (query oblivious for short) 
if the outputs of A do not depend on the order of the queries but depend only on the input and the random 
bits generated on the random tape of A. 

Definition 2.3 (Parallelizable ll25l ). We say an LCA A is parallelizable if A supports parallel queries, that 
is the LCA is able to answer multiple queries simultaneously so that all the answers aie consistent. 

2.2 fc-wise independent random variables 

Let 1 < < 77, be an integer. A distribution D : {0, 1}" — )■ M-*^ is k-wise independent if restricting D 
to any index subset S C [n] of size at most k gives rise to a uniform distribution. A random variable is 
said to be A;-wise independent if its distribution function is fc-wise independent. Recall that the support of 
a distribution D, denoted supp(Z)), is the set of points at which D{x) > 0. We say a discrete distribution 
D is symmetric if D{x) = l/|supp(Z))| for every x G supp(Z)). If a distribution D : {0, 1}"^ — )■ M-" is 
symmetric with |supp(Z))| < 2"^ for some m < n, then we may index the elements in the support of D 
by {0, 1}™ and call m the seed length of the random variable whose distribution is D. We will need the 
following construction of /c-wise independent random variables over {0, 1}" with small symmetric sample 
space. 
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Theorem 2.4 ((31). For every 1 < k < n, there exists a symmetric distribution D : {0, 1}" — > M-" of 
support size at mostn^^^ and is k-wise independent. That is, there is a k-wise independent random variable 
X = (xi, . . . ,Xn) whose seed length is at most 0{klogn). Moreover, for any 1 < i < n, Xi can be 
computed in space 0{k log n). 

3 Bounding the size of a random query tree 
3.1 The problem and our main result 

Consider the following scenario which was first studied by |[20l in the context of constant-time approxi- 
mation algorithms for maximal matching and some other problems. We are given a graph G = {V, E) of 
bounded degree D. A real number r{v) € [0, 1] is assigned independently and uniformly at random to every 
vertex v in the graph. We call this random number the rank of v. Each vertex in the graph G holds an input 
x{v) G R, where the range R is some finite set. A randomized Boolean function F is defined inductively on 
the vertices in the graph such that F{v) is a function of the input x{v) at v as well as the values of F at the 
neighbors w of v for which r{w) < r{v). The main question is, in order to compute F{vo) for any vertex 
vq in G, how many queries to the inputs of the vertices in the graph are needed? 

Here, for the purpose of upper bounding the query complexity, we may assume for simplicity that the 
graph G is L'-regular and furthermore, G is an infinite D-regular tree rooted at vq. It is easy to see that 
making such modifications to G can never decrease the query complexity of computing F{vq). 

Consider the following question. We are given an infinite D-regular tree T rooted at vq. Each node w in 
T is assigned independently and uniformly at random a real number r{w) € [0, 1]. For every node w other 
than Vq in T, let parent (if) denote the parent node of w. We grow a (possibly infinite) subtree T of T rooted 
at V as follows: a node w is in the subtree T if and only if parent(u;) is in T and r{w) < r(parent(u;)) (for 
simplicity we assume all the ranks are distinct real numbers). That is, we start from the root v, add all the 
children of v whose ranks are smaller than that of v to T. We keep growing T in this manner where a node 
li;' G T is a leaf node in T if the ranks of its D children are all larger than r{w'). We call the random tree T 
constructed in this way a query tree and we denote by |r| the random variable that corresponds to the size 
of T. We would like to know what are the typical values of |T|. 

Following EOl |2T1| . we have that, for any node w that is at distance t from the root vq, Pi[w G 
r]=l/(t-i-l)! as such an event happens if and only if the ranks of the t + 1 nodes along the shortest path 
from Vq to w is in monotone decreasing order. It follows from linearity of expectation that the expected 
value of |T| is given by the elegant formula f2T\ 

t=o ^ ' 

which is a constant depending only on the degree bound D. 

Our main result in this section can be regarded as showing that in fact |T| is highly concentrated around 
its mean: 

Theorem 3.1. For any degree bound D > 2, there is a constant C{D) which depends on D only such that 
for all large enough N, 

Pr[|T| > C{D)\og^+^N] < 
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3.2 Breaking the query tree into levels 



A key idea in the proof is to break the query tree into levels and then upper bound the sizes of the subtrees 
on each level separately. First partition the interval [0, 1] into D + 1 sub-intervals: /j := (1 — jj^, 1 — jy^] 
for i = 1,2, . . . , D and Id+i = [0, j^^rj]- We then decompose the query tree T into D + I levels such that 
a node u € T is said to be on level i if r{v) € li. For ease of exposition, in the following we consider the 
worst case that r{vo) € h. Then the vertices of T on level 1 form a tree which we call Ti = r^^^^ rooted at 
vq. The vertices of T on level 2 will in general form a set of trees {Tg^^ . . . , T2™^^ }, where the total number 
of such trees ?n2 is at most D times the number of nodes in Ti (we have only inequality here because some 
of the child nodes in T of the nodes in Ti may fall into levels 2, 3, etc). Finally the nodes on level D + 1 
form a forest {T^^^^, . . . , T^"^^^^^}. Note that all these trees {T^^"'^} are generated by the same stochastic 
process, as the ranks of all nodes in T are i.i.d. random variables. The next lemma shows that each of the 
subtrees on any level is of size 0(log N) with probability at least 1 — 1 /N"^, 

Lemma 3.2. For any 1 < i < D + 1 and any 1 < j < rrii, with probability at least 1 — 1/A^^, \T^^\ = 
0(log N). 

One can see that Theorem 13 . II folio ws directly from Lemma ll!2l Once again we consider the worst case 
that r{vQ) E Ii. By Lemma IT2l the size of Ti is at most 0(log N) with probability at least 1 — 1/A^'^. In 
what follows, we always condition our argument upon that this event happens. Notice that the root of any 
tree on level 2 must have some node in Ti as its parent node; it follows that 1112, the number of trees on level 
2, is at most D times the size of Ti, hence m2 = 0(log N). Now applying Lemma [3^ to each of the m2 
trees on level 2 and assume that the high probability event claimed in Lemma 13.21 happens in each of the 
subtree cases, we get that the total number of nodes at level 2 is at most 0(log^ A^). Once again, any tree 
on level 3 must have some node in either level 1 or level 2 as its parent node, so the total number of trees 
on level 3 is also at most D{0{\og N) + 0(log^ N)) = 0(log^ N). Applying this argument inductively, 
we get that rrii = 0(log*~^ N) for i = 2, 3, . . . , D + 1. Consequently, the total number of nodes at all 

D + 1 levels is at most 0(log N) + 0(log^ A) H h 0(log^+^ A) = 0(log^+^ A), assuming the high 

probability event in Lemma [J!2] holds for all the subtrees in all the levels. By the union bound, this happens 
with probability at least 1 - 0(log^+^ N)/N^ > 1 - l/A^, thus proving Theorem O 

The proof of Lemma requires results in branching processes, in particular the Galton- Watson pro- 
cesses. 

3.3 Galton- Watson processes 

Consider a Galton- Watson process defined by the probability function p := {p^; k = 0, 1, 2, . . .}, with 
Pk ^ and J2kPk — 1- Let f{s) = YlT=oPk^'^'' '^^e generating function of p. For i = 0, 1, . . . , let 
be the number of off-springs in the generation. Clearly Zq = 1 and {Zi : i = 0,1, . . .} form a Markov 
chain. Let m := ¥,[Zi] = J2k ^Pk be the expected number of children of any individual. The classical result 
of the Galton-Watson processes is that the survival probability (namely lim„_j.oo P^i^n > 0]) is zero if and 
only if m < 1. Let Z = Zq + Zi + ■ ■ ■ be the sum of all off-springs in all generations of the Galton-Watson 
process. The following result of Otter is useful in bounding the probability that Z is large. 

Theorem 3.3 (1221). Suppose pq > and that there is a point a > within the circle of convergence of 
f for which af'{a) = f{a). Let a = a/ f{a). Let t = gcd{r : pr > 0}, where gcd stands for greatest 
common divisor Then 
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[o, ifn^l (modi). 

In particular, if the process is non-arithmetic, i.e. gcd{r : > 0} = 1, and ^^"^^^ is finite, then 

Pr[Z = n]= 0(a-"n-3/2), 
and consequently Pr[Z > n] = 0(a~"). 

3.4 Proof of Lemma |331 

To simplify exposition, we prove Lemma [X2l for the case of tree Ti. Recall that Ti is constructed recursively 
as follows: for every child node v of vq in T, we add v to Ti if r{v) < r(wo) and r{v) € Ii. Then for every 
child node v of vq in Ti, we add the child node w of u in T to Ti if r(ii;) < r{v) and r(tt;) € /i. We repeat 
this process until there is no node that can be added to Ti . 

Once again, we work with the worst case that r{vo) = 1. To upper bound the size of Ti, we consider a 
related random process which also grows a subtree of T rooted at vq, and denote it by T[. The process that 
grows T[ is the same as that of Ti except for the following difference: if u G T[ and w is a child node of v 
in T, then we add w to T[ as long as r{w) € h, but give up the requirement that r{w) < r{v). Clearly, we 
always have Ti C T{ and hence |T{| > |Ti|. 

Note that the random process that generates T[ is in fact a Galton-Watson process, as the rank of each 
node in T is independently and uniformly distributed in [0, 1]. Since = 1/{D + 1), the probability 
function is 

P = {(1 - 9)^, (^) 1(1 - ( 2 ) ^'(1 - ■■■^1% 

where q := 1 / {D + 1) is the probability that a child node in T appears in T[ when its parent node is in T[. 
Note that the expected number of children of a node in T{ is Dq = D /{D + 1) < 1, so the tree T{ is a finite 
tree with probability one. 

The generating function of p is 

f{s) = {l-q + qs)^, 

as the probability function {pk} obeys the binomial distribution pk = b{k,D,q). In addition, the conver- 
gence radius of / is p = oo since {p^} has only a finite number of non-zero terms. 

Solving the equation af'{a) = f{a) yields a = q(j)\^ = tj^- It follows that (since D > 2) 

f"{a) = q^D{D - 1) (^l - q + ' > 0, 

hence the coefficient in ([T]) is non-singular. 
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Let a{D) := a/f{a) = l//'(a), then 



l/aP) = /» = -^(^)^-i 

= (1 + 1 ^iD^-l)/(D+l] D 



D"^-!' D + l 
D + l 

. N l/(D + l) p, 

^ \D+l \ ^ 



< 1 + 



where in the third and the fourth steps we use the inequality (see e.g. |[T8l ) that (1 + j)* < e < (1 + \f^^ 
for any positive integer t. This shows that a(D) is a constant greater than 1. 

Now applying Theorem 13.31 to the Gal ton- Watson process which generates T[ (note that t = 1 in our 
case) gives that, for all large enough n, Pr[|T{| = n] < 2'"^" for some constant c. It follows that Pr[|T{j > 
n] < X]£n 2 ^^* — 2~^(") for all large enough n. Hence for all large enough N , with probabihty at least 
1 - l/iV^ iTil < \T[\ = O(logiV). Thiscompletestheproof of Lemma O 



4 Construction of almost A;-wise independent random orderings 

An important observation that enables us to make some of our local algorithms run in polylogaiithmic space 
is the following. In the construction of a random query tree T, we do not need to generate a random real 
number r{v) S [0, 1] independently for each vertex f € T; instead only the relative orderings among the 
vertices in T matter. Indeed, when generating a random query tree, we only compare the ranks between 
a child node w and its parent node v to see if r{w) < r{v); the absolute values of r{w) and r{v) are 
irrelevant and are used only to facilitate our analysis in Section |3] Moreover, since (almost surely) all our 
computations in the local algorithms involve only a very small number of, say at most k, vertices, so instead 
of requiring a random source that generates total independent random ordering among all nodes in the graph, 
any pseudorandom generator that produces k-wise independent random ordering suffices for our purpose. 
We now give the formal definition of such orderings. 

Let m > 1 be an integer. Let V be any set with m elements. For simplicity and without loss of 
generality, we may assume that V = [m]. Let 7^ be a totally ordered set. An ordering of [m] is an injective 
function r : [m] — > TZ. Note that we can project r to an element in the symmetric permutation group Sm in 
a natural way: arrange the elements {r(l), . . . , r{7n)} in Tl in the monotone increasing order and call the 
permutation of [m] coiTcsponding to this ordering the projection of r onto Sm and denote it by Psm"^- In 
general the projection Ps^^^ is not injective. Let r = {rj}jg/ be any family of orderings indexed by /. The 
random ordering Dj. of [m] is a distribution over a family of orderings r. For any integer 2 < k < m, we 
say a random ordering Dj. is k-wise independent if for any subset S C [m] of size k, the restriction of the 
projection onto Sm of Dr over S is uniform over all the k\ possible orderings among the k elements in 5. A 
random ordering Dr is said to e-almost k-wise independent if the statistical distance between D^ is at most 
e from some A;-wise independent random ordering. Note that our definitions of fc-wise independent random 
ordering and almost /c-wise independent random ordering are different from that of A;-wise independent 
permutation and almost /c-wise independent permutation (see e.g. HI), where the latter requires that the 
function to be a permutation (i.e., the domain and the range of the function are the same set). In this 
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section we give a construction of ^-almost A;-wise independent random ordering whose seed length is 
0{k log^ m). In our later applications k = polylogm so the seed length of the almost /c-wise independent 
random ordering is also polylogarithmic. 

Theorem 4.1. Let m > 2 be an integer and let 2 < k < m. Then there is a construction of -^-almost 
k-wise independent random ordering over [m] whose seed length is 0{k\o^ m). 

Proof. For simplicity we assume that m is a power of 2. Let s = 4 log m. We generate s independent copies 
of /c-wise independent random variables Zi, . . . , with each Z^, 1 < £ < s, in {0, 1}™. By Theorem 12.41 
the seed length of each random variable is 0{k log m) and therefore the total space needed to store these 
random seeds is 0{k log^ m). Let these fc-wise independent m-bit random variables be 

Zl = ^1,1, . . . , Zi^m', 
Z2 = Z2,l, . . . , Z2,m] 



Zg -^Sjl ) ■ ■ ■ ; -Zs^m ■ 

Now for every 1 < i < m, we view each r{i)^= z\^iZ2^% ■ ■ ■ Zs,i as an integer in {0, 1, . . . , 2^* — 1} written 
in the s-bit binary representation and use r : [m] — )• {0, 1, . . . , 2^ — 1} as the ranking function to order the 
m elements in the set. We next show that, with probability at least 1 — l/m?, r(l), . . . , r(m) ai^e distinct m 
integers. 

Let 1 < i < J < m be any two distinct indices. For every 1 < £ < s, since . . . , z^^m are /c-wise 
independent and thus also pair-wise independent, it follows that Pr[zf j = z^j] = 1/2. Moreover, as all 
Zi , . . . , are independent, we therefore have 

Pr[r(z) = r{j)] = Pr[2:^^j = Zij for every 1 < £ < s] 

s 

= JJ Vi[zi^i = zij] 
1=1 

= (1/2)^ 

= 1/m^ 

Applying a union bound argument over all (^) distinct pairs of indices gives that with probability at least 
1 — 1/nn?, all these m numbers are distinct. 

Since each Z^, 1 < ^ < s, is a fc-wise independent random variable in {0, 1}'", therefore for any subset 
{ii, . . . of k indices, (r(ii), . . . ,r{ik)) is distributed uniformly over all 2^"^ tuples. By symmetry, 
conditioned on that r(ii ),..., r(ifc) are all distinct, the restriction of the ordering induced by the ranking 
function r to {ii, . . . is completely independent. Finally, since the probability that r(l), . . . ,r{m) 
are not distinct is at most it follows that the random ordering induced by r is ^-almost A:- wise 

independent. □ 

5 LCA for Hypergraph Coloring 

We now apply the technical tools developed in Section[3]and Section|4]to the design and analysis of LCAs. 

Recall that a hypergraph H is a pair H = {V, E) where F is a finite set whose elements are called 
nodes or vertices, and is a family of non-empty subsets of V, called hyperedges. A hypergraph is called 
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k-uniform if each of its hyperedges contains precisely k vertices. A two-coloring of a hypergrapli is a 
mapping c : F — )■ {red, blue} such that no hyperedge in E is monochromatic. If such a coloring exists, 
then we say H is two-colorable. In this paper we assume that each hyperedge in H intersects at most d 
other hyperedges. Let be the number of hyperedges in H. Here and after we think of k and d as fixed 
constants and all asymptotic forms are with respect to N. By the Lovasz Local Lemma (see, e.g. when 
e{d + 1) < 2^~^, the hypergraph H is two-colorable. 

Following |[25l . we let m be the total number of vertices in H. Note that m < kN, so m = 0{N). 
For any vertex x G y, we use £{x) to denote the set of hyperedges x belongs to. For any hypergraph 
H = {V, E), we define a vertex-hyperedge incidence matrix Ai € {0, ij^^^ so that, for every vertex x 
and every hyperedge e, ^Ax,e = 1 if and only if e G £{x). Because we assume both k and d are constants, 
the incidence matrix A4 is necessarily very sparse. Therefore, we further assume that the matrix A4 is 
implemented via linked lists for each row (that is, vertex x) and each column (that is, hyperedge e). 

Let G be the dependency graph of the hyperedges in H. That is, the vertices of the undirected graph G 
are the N hyperedges of H and a hyperedge Ei is connected to another hyperedge Ej in G if Eid Ej / 0. 
It is easy to see that if the input hypergraph is given in the above described representation, then we can find 
all the neighbors of any hyperedge Ei in the dependency graph G (there are at most d of them) in 0(log N) 
time. 

5.1 Overview of Alon's algorithm 

We now give a sketch of Alon's algorithm 121; for a detailed description of the algorithm in the context of 
LCA see lIBl . 

The algorithm runs in three phases. In the first phase, we go over all the vertices in the hypergraph in 
any order and color them in {red, blue} uniformly at random. During this process, if any hyperedge has too 
many vertices (above some threshold) in it are colored in one color and no vertex is colored in the other color, 
then this hyperedge is said to become dangerous. All the uncolored vertices in the dangerous hyperedges are 
then frozen and will be skipped during Phase 1 coloring. A hyperedge is called survived if it does not have 
vertices in both colors at the end of Phase 1. The basic lemma, based on the breakthrough result of Beck IH, 
claims that after Phase 1 , almost surely all connected components of the dependency graph H of survived 
hyperedges are of sizes at most 0(log A^). We then proceed to the second phase of the algorithm which 
repeats the same coloring process (with some different threshold parameter) for each connected component 
and gives rise to connected components of size 0(log log N). Finally in the third phase we perform a brute- 
force search for a valid coloring whose existence is guaranteed by the Lovasz local lemma. As each of the 
connected components in Phase 3 has at most 0(loglog A^) vertices, the running time of each brute force 
seaixh is thus bounded by polylogA^. 

To turn Alon's algorithm into an LCA, Rubinfeld et al. |[25l note that one may take the order that vertices 
are queried as the order to color the vertices and then in Phase 2 and Phase 3 focus only on the connected 
components in which the queried vertex lie. This leads to an LCA with polylogarithmic running time but 
the space complexity can be linear in the worst case (as the algorithm needs to remember the colors of 
all previously queried or colored vertices). In addition, the LCA is not query oblivious and not easily 
parallelizable. 

5.2 New LCA for Hypergraph Coloring 

To remedy these, we add several new ingredients to the LCA in f25\ and achieve an LCA with both time and 
space complexity are polylogarithmic. In addition, the LCA is query oblivious and easily parallelizable. 
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1st ingredient: bounded-degree dependency. We first make use of the following simple fact: the color of 
any fixed vertex in the hypergraph depends only on the colors of a very small number of vertices. Specif- 
ically, if vertex x lies in hyperedges Ei, . . . , Ed' , then the color of x depends only on the colors of all the 
vertices in E'l , . . . , E^' ■ As every hyperedge is A;-uniform and each hyperedge intersects at most d other 
hyperedges, the color of any vertex depends on at most the colors of D = k{d + 1) other vertices. 

2nd ingredient: random permutation. Note that in the first phase of Alon's coloring algorithm, any order 
of the vertices will work. Therefore, we may apply the idea of random ordering in f20]. Specifically, suppose 
we are given a random number generator r : [m] [0,1] which assign a random number uniformly and 
independently to every vertex in the hypergraph. Suppose the queried vertex is x. Then we build a (random) 
query tree T rooted at x using BFS as follows: there ai^e at most D other vertices such that the color of x 
depends on the colors of these vertices. Let y be any of such vertex. If r{y) < r{x), i.e. the random number 
assigned to y is smaller than that of x, then we add y as a child node of x in T. We build the query tree this 
way recursively until there is no child node can be added to T. By Theorem 13.11 with probability at least 
1 — 1/m?, the total number of nodes in T is at most polylogm and is thus also at most polylogA'^. This 
implies that, if we color the vertices in T in the order from bottom to top (that is, we color the leaf nodes 
first, then the parent nodes of the leaf nodes and so on, and color the root node x last), then for any vertex 
X, with probability at least 1 — 1/m? we can follow Alon's algorithm and color at most polylogA^ vertices 
(and ignore all other vertices in the hypergraph) before coloring x. Therefore the running time of the first 
phase of our new LCA is (almost surely) at most polylogA'^. 

3rd ingredient: A:-wise independent random ordering. The random permutation method requires linear 
space to store all the random numbers that have been revealed in previous queries in order to make the 
answers consistent. However, two useful observations enable us to reduce the space complexity of random 
ordering from linear to polylogarithmic. First, only the relative orderings among vertices matter: in building 
the query tree T we only check if r{y) < r{x) but the absolute value of r(x) and r{y) are irrelevant. There- 
fore we can replace the random number generator r with an equivalent random ordering function r G Sm, 
where Sm is the symmetric group on m elements. Second, as the query tree size is at most polylogarith- 
mic almost surely, the random ordering function r need not be totally random but a polylogarithmic-wise 
independent permutation suffice^ Therefore we can use the construction in Theorem 14. II of ^-almost 
fc-wise independent random ordering of all the vertices in the hypergraph with k = polylogA^. The space 
complexity of such a random ordering, or the seed length, is 0{k log^ m) = polylogA^. 

4th ingredient: /c-wise independent random coloring. Finally, the space complexity for storing all the 
random colors assigned to vertices is also linear in worst case. Once again we exploit the fact that all 
computations in LCAs are local to reduce the space complexity. Specifically, the proof of the basic lemma of 
Alon's algorithm (see e.g. IJl Claim 5.7.2]) is valid as long as the random coloring of the vertices is clog A^- 
wise independent, where c is some absolute constant. Therefore we can replace the truly random numbers 
in {0, 1}™ used for coloring with a clog A^-wise independent random numbers in {0, 1}™ constructed in 
Theorem |2.4| thus reducing the space complexity of storing random colors to 0(log^ A^). 

5.3 Pseudocode of the LCA and main result 

To put everything together, we have the following LCA for Hypergraph Coloring as illustrated in Fig. [H 
Fig. |2] and Fig. [3] In the preprocessing stage, the algorithm generates 0{ ^^°fj^j^ ) copies of pseudo-random 

" Since the full query tree has degree bound D, so the total number of nodes queried in building the random query tree T is at 
most D\T\, which is also at most polylogarithmic. 
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LCA for Hypergraph Coloring 

Preprocessing: 

1. generate 0{ ^^°fj^j^ ) copies of clog A^-wise independent random variables in {0, 1} 

2. generate a ^-almost polylogTV-wise independent random ordering over [m] 
Input: a vertex x ^ V 

Output: a color in { red, blue } 

1 . Use BFS to grow a random query tree T rooted at x 

2. Color the vertices in 7" bottom up 

3. If X is colored red or blue, return the color 

Else run Phase 2 Coloring(x) 



Figure 1 : Local computation algorithm for Hypergraph Coloring 

colors for every vertex in the hypergraph and a pseudorandom ordering of all the vertices. To answer each 
query, the LCA runs in three phases. Suppose the color of vertex x is queried. During the first phase, 
the algorithm uses BFS to build a random query tree rooted at x and then follows Alon's algorithm to 
color all the vertices in the query tree. If x gets colored in Phase 1, the algorithm simply returns that 
color; if x is frozen in Phase 1, then Phase 2 coloring is invoked. In the second phase, the algorithm first 
explores the connected components around x of survived hyperedges. Then Alon's algorithm is performed 
again, but this time only on the vertices in the connected component. For some technical reason, the random 
coloring process is repeated 0{ y^°f^j^ ) time^ until a good coloring is found which makes all the surviving 
connected components after Phase 2 very small. If x gets colored in the good coloring, then that color is 
returned; otherwise the algorithm runs the last phase, in which a brute-force seaixh is performed to find the 
color of X. 

The time and space complexity as well as the error bound of the LCA are easy to analyze and we have 
the following main result of LCA for Hypergraph Coloring: 

Theorem 5.1. Let d and k be such that there exist three positive integers ki, k2 and k^ such that the fallow- 
ings hold: 

ki + k2 + k3 = k, 
16d{d-lf{d+l) < 
16d{d-lf{d+l) < 2'=2, 
2e{d+l) < 

Then there exists a (polylogiV, polylogiV, 1 / N)-local computation algorithm which, given a hypergraph H 
and any sequence of queries to the colors of vertices {xi,X2, . . . ,Xs), returns a consistent coloring for all 
Xi 's which agrees with some 2-coloring of H. 

6 LCA for Maximal Independent Set 

Recall that an independent set (IS) of a graph G is a subset of vertices such that no two vertices in the set 
are adjacent. An independent set is called a maximal independent set (MIS) if it is not properly contained in 
any other IS. 

''This is why the algorithm generates many copies of independent pseudorandom colorings at the beginning of the LCA. 
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Phase 2 Coloring(x) 

Input: a vertex x ^ V 

Output: a color in {red, blue] or FAIL 

1 . Start from £{x) to explore G in order to find the connected 

component Ci (x) of survived hyperedges around x 

2. If the size of the component is larger than C2 log N 

Abort and return FAIL 

3. Repeat the following 0{ ^^°fj^j^^ ) times and stop if a good coloring is founcQ 

(a) Color all the vertices in Ci (x) uniformly at random 

(b) Explore the dependency graph of G\s-^{x) 

(c) Check if the coloring is good 

4. If X is colored in the good coloring, return that color 

Else run Phase 3 Coloring(a;) 

" Following 1251 . let 5i (x) be the set of surviving hyperedges in Ci (x) after all vertices in Ci (x) 
are either colored or are frozen. Now we explore the dependency graph of Si{x) to find out all the 
connected components. We say a Phase 2 coloring is good if all connected components in G\si {x) 
have sizes at most C3 log log A'^, where C3 is some absolute constant. 



Figure 2: Local computation algorithm for Hypergraph Coloring: Phase 2 



Phase 3 Coloring(a;) 

Input: a vertex x 

Output: a color in { red, blue } 

1 . Start from £{x) to explore G in order to find the connected 

component of all the survived hyperedges around x 

2. Go over all possible colorings of the connected component 

and color it using a feasible coloring. 

3. Return the color c of x in this coloring. 



Figure 3: Local computation algorithm for Hypergraph Coloring: Phase 3 
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In Il24ll25l . a two-phase LCA is presented for MIS. For completeness, we present the pseudocode of the 
LCA in Appendix El Let G be a graph with maximum degree d and suppose the queried vertex is v. In 
the first phase, the LCA simulates Luby's algorithm for MIS |[T5l . However, instead of running the parallel 
algorithm for O(logn) rounds as the original Luby's algorithm, the LCA simulates the parallel algorithm 
for only 0{dlogd) rounds. Following an argument of Pamas and Ron ll23l . the sequential running time for 
simulating the parallel algorithm to determine whether a given node is in the MIS is If V or any of 

v's neighbors is put into the independent set during the first phase, then the algorithm return "Yes" or "No", 
respectively. If, on the other hand, v lies in some connected component of "surviving" vertices after running 
the first phase, then the algorithm proceeds to the second phase algorithm, in which a simple linear-time 
greedy search for an MIS of the component is performed. A key result proved in |[24l l25l is that, after the 
first phase of the algorithrn, almost surely all connected components have sizes at most 0{poly{d) logn). 
Therefore the running timcl of the second phase is log n. 

To implement such a two-phase LCA and ensure that all answers are consistent, we need to maintain a 
random tape that keeps a record of all the generated random bits during previous runs, which implies the 
space complexity of the LCA is linear in the worst case. To see this, suppose two vertices u and v are 
connected in G and u is queried first. Suppose further that the LCA runs on u and finds out during the first 
phase that u is in the IS. If vertex v is queried at some time later, we need to ensure that, when simulating 
Luby's algorithm u is put in the IS in some round (hence v is deleted in the round after that). This in turn 
requires that we retrieve the random bits used during the run of LCA on u. 

A simple but crucial observation which enables us to reduce the space complexity of the LCA for MIS 
is, since all the computations are "local", we may replace the truly random bits used in the algorithm with 
random bits of limited independence constructed in Theorem 12.41 

First we round the degree bound of G to d = 2 . Note that d < d < 2d. Now we can generate the 
probability l/2d used in Luby's algorithm (c.f. Figure H]) by tossing log d = [log d\ independent fair coins. 

Since the second phase of the LCA is deterministic, we can therefore focus on the first phase only. The 
running time of the first phase is shown to be |[25l . Following the notation in |[25l . for a vertex v 

in G, let Ay be the event that t; is a surviving vertex at the end of Phase 1 and let be the event that v is 
in state "_L" after running MIS^ for 0{d\og d) rounds, where MIS^ is a variant of MIS, a subroutine of 
the first phase algorithm. It was shown in |[25l that C By (Claim 4.2) and for any subset of vertices W, 

Pr[all vertices in W are surviving vertices] 

= Pr[n„eiyA^] 
< Fr[ny(.wBy]. 

Following the proof of Lemma 4.6 in [SSl, a graph H on the vertices V{G) is called a dependency graph 
for {i?i)}t,gy(G) if for ^ event By is mutually independent of all B^ such that (u, v) ^ H. Let 
denote the "distance-3" graph of H, that is, vertices u and v are connected in if their distance in H is 
exactly 3. Let be a subset of vertices in H^. Then, since all vertices in W are at least 3-apart, all the 
events {By}y^]v are mutually independent, it follows that the probability that all vertices in W are surviving 
vertices satisfies 

FT[nyeWBy] = H PT[By]. 

Finally in the proof of Lemma 4.6 in ll25l . the size of W is taken to be ci logn for some constant ci to 

''Note that we need to run a BFS starting from v to explore the connected component in which v lies. Each step of the BPS 
incurs a run on the explored node of the first phase LCA. 
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show that, almost surely all connected components of surviving vertices after Phase 1 are of sizes at most 
poly((i) log n. 

Now we try to replace the true random bits used in the LCA in [1251 with pseudorandom bits of limited 
independence. Firstly, since the running time of the first phase is d'^^^°^^\h&nce this is also the running time 
of the algorithm if the subroutine MIS is replaced with MIS^. It follows that each event depends on at 
most • log d = ci'-^('°S'^) random bits. Secondly, the argument we sketched in the last paragraph is 

still valid as long as the events {B^}^,^jj3 are ci logn-wise independent. Such a condition is satisfied if the 
random bits used in the algorithm ai^e A;-wise independent, where k = • ci logn = log??,. 

Note that the total number of random bits used during the first phase for all vertices is ?ti = ■ n. 

Therefore all we need is a A;-wise independent random variable in {0, 1}™. By Theorem I2.4[ such random 
variables can be constructed with seed length O(fclogm) = log^n and each random bit can be 

computed in time 0{k log m) = d'^('^i°s'^) log^ n. 

To put everything together, we proved the following theorem reg arding the LCA for Misl: 

Theorem 6.1. Let G be an undirected graph with n vertices and maximum degree d. Then there is a 
^Oidlogd) ^Qg3 ^0{logd) ^Qg2 l/n)-local computation algorithm which, on input a vertex v, decides ifv 
is in a maximal independent set. Moreover, the algorithm will give a consistent MIS for every vertex in G. 
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Maximal Independent Set: Phase 1 
Input: a graph G and a vertex v G V 
Output: {"true", "false", "_L"} 
For i from 1 to r = 20d log d 

(a) If MIS(u, i) = "selected" 
return "true" 

(b) Else if MIS(u, i) = "deleted" 
return "false" 

(c) Else 
return "_L" 

MIS{v,i) 

Input: a vertex v V and a round number i 
Output: {"selected", "deleted", "_L"} 

1 . If w is marked "selected" or "deleted" 

return "selected" or "deleted", respectively 

2. For every u in N{v) 

If MIS(m, i-l)= "selected" 

mark v as " deleted" and return "deleted" 

3. V chooses itself independently with probability ^ 

If V chooses itself 

(i) For every u in N{v) 

If u is marked "±", u chooses itself independently with probability 

(ii) If V has a chosen neighbor 
return "_L" 

(iii) Else 

mark v as "selected" and return "selected" 

Else 

return "_L" 



Figure 4: Local computation algorithm for MIS: Phase 1 
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A Pseudocode of the LCA for Maximal Independent Set 

In this section we present the pseudocode of the LCA for Maximal Independent Set. This is taken from ||25l 
with slight modifications and we also refer interested readers to ll25l for detailed description and analysis of 
the algorithm. 
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MISB(w,i) 

Input: a vertex v ^ V and a round number i 
Output: {"picked", "_L"} 

1 . If w is marked "picked" 

return "picked" 

2. V chooses itself independently with probability ^ 

If V chooses itself 

(i) For every u in N{v) 

u chooses itself independently with probability 

(ii) If V has a chosen neighbor 
return "_L" 

(iii) Else 

mark v as "picked" and return "picked" 

Else 

return "±" 



Figure 5: Algorithm MISs 



Maximal Independent Set: Phase 2 
Input: a graph G and a vertex v €V 
Output: { "true", "false" } 

1 . Run BFS starting from v to grow a connected component of surviving vertices 
(If a vertex u is in the BFS tree and w G N{u) in G, then w is in the BFS tree 

if and only if running the first phase LCA on w returns "_L") 

2. (Run the greedy search algorithm on the connected component for an MIS) 

Set 5 = 

Scan all the vertices in the connected component in order 
If a vertex u is not deleted 
add uio S 

delete all the neighbors of u 

3. Ifv&S 

return "true" 
else "false" 



Figure 6: Local computation algorithm for MIS: Phase 2 
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